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1.  Introduction 


Knuth,  [11],  Dixon,  [6]  and  [7],  and  Heilbronn,  [8],  have  recently 
investigated  in  considerable  depth  the  average  number  of  divisions  performed 
in  the  Euclidean  algorithm  for  integers.  Although  many  interesting  questions 
remain  unanswered,  the  relatively  elementary  result  of  Dixon  in  [7]  already 
suffices  to  completely  determine  the  average  computing  time  of  the  Euclidean 
algorithm  to  within  a  constant  factor,  which  factor  is  in  any  case  dependent 
on  the  particular  computer  used  and  inessential  details  of  the  implementation. 
Such  a  determination  of  the  average  computing  time  of  the  Euclidean  algorithm 
is  the  main  result  of  the  present  paper.  The  maximum  and  minimum  computing 
times  of  the  Euclidean  algorithm  for  integers  will  also  be  derived  since, 
although  their  determination  is  quite  elementary,  they  have  apparently  not 
previously  been  published.  These  computing  times  are  all  derived  as  functions 
of  three  variables,  namely  the  lengths  of  the  two  inputs  and  the  length  of  the 
resulting  g.c.d.  (greatest  common  divisor).  Previous  results  on  the  computing 
time  of  the  Euclidean  algorithm  ([2]  and  [11],  Section  h.^.2.  Exercise  30)  have 
been  limited  to  upper  bounds  on  the  maximum  computing  time. 


2.  Dominance  and  Codominance 

The  relations  of  dominance  and  codominance  between  real-valued  functions 
were  introduced  in  [3],  where  they  were  used  in  the  analysis  of  the  computing 
time  of  an  algorithm  for  polynomial  resultant  calculation.  The  related  concepts 
and  notation  have  subsequently  been  adopted  by  several  authors,  for  example. 
Brown,  [1],  Heindel,  [9],  and  Musser,  [12].  The  definitions  and  some  funda¬ 
mental  properties  will  be  repeated  here  since  they  will  not  yet  be  familiar  to 
many  readers. 

If  f  and  g  are  real-valued  functions  defined  on  a  conmon  domain  S  we  say 
that  f  is  dominated  b^  g,  and  write  f  <  g,  in  case  there  is  a  positive  real 
number  c  such  that  f(x)<  cg(x)  for  all  x  S.  We  may  also  say  that  g  dominates 
f,  and  write  g  >  f.  Dominance  is  clearly  a  reflexive  and  transitive  relation. 

It  is  important  to  note  that  the  definition  is  not  restricted  to  functions  of 
one  variable  since  the  elements  of  S  may  be  n-tuples. 

Knuth  ([10],  pp.  10fc-108)  defines  f(x)«0(g(x))  in  case  there  is  a  positive 
constant  c  such  that  |f(x)|<  c-|g(x)|.  As  long  as  one  is  dealing  only  with  non' 
negative  valued  functions,  this  formally  coincides  with  the  definition  above  of 
f  ■<  g.  Although  Knuth  implies  that  this  definition  is  applicable  only  when  f 
and  g  are  functions  of  one  variable,  he  in  fact  uses  it  for  functions  of  more 
than  one  variable  (e.g.  [11],  p.  388)  in  a  manner  which  is  consistent  with  our 
definition.  Thus  dominance  is  apparently  a  new  notation  and  terminology  but 
not  a  new  concept.  Although  Knuth  discussed  at  length  the  logical  weaknesses 
of  the  0-notation,  he  chose  not  to  abandon  it  in  favor  of  the  more  natural 
notation  of  an  order  relation. 

If  f  4  g  and  g  A  f  then  we  say  that  f  and  g  are  codominant,  and  write 
f'Vg.  Codominance  is  clearly  an  equivalence  relation.  If  f  1  g  hut  not  g  4  t 
then  we  say  that  f  is  strictly  dominated  b£  g,  and  write  f-<  g.  We  may  also 


say  that  g  strict ly  dominates  f,  and  write  g  >f.  Strict  dominance  is 
clearly  irreflexive  and  transitive.  Whereas  the  O-notation  has  no  counter¬ 
parts  for  the  codominance  and  strict  dominance  relations,  it  will  become 
apparent  that  these  are  important  concepts  in  algorithm  computing  time  analyses. 
Furthermore,  the  O-notation  has  a  somewhat  different  meaning  in  asymptotic  a- 
nalysis  than  the  one  used  by  Knuth  (see,  e.g.,  [ 5] )  ■ 

If  f  and  g  are  functions  defined  on  S  and  is  a  subset  of  S,  it  will 
often  be  convenient  to  write  f  4  g  on  in  case  f^  K  g^,  where  f^  and  g^  are 

the  functions  f  and  g  restricted  to  S,  .  Also,  if  S  C  S.x...xS  ,  a  Cartesian  prod- 
uct,  we  will  denote  by  f  the  function  f  restricted  to  ( {a}  C\  SgX. . .xS^JO  S ;  that  is, 

f  (x  . x^ )  =  f ( a . ,xn)  for  (a.Xg, . . .  ,Xn)  £  S  .  Similarly  we  may  fix  any 

other  of  the  n  variables  of  f. 

Dominance  and  codominance  have  the  following  fundamental  properties,  most 
of  which  were  listed  by  Musser  in  [  12] . 

Theorem  1.  Let  f,  f ^ ,  f,,  g,  g^  and  be  non-negative  real  valued  functions 

on  S,  and  let  c  be  a  positive  real  number.  Then 

(a)  f^cf 

(b)  If  fL  <  gL  and  fg  ^  gg,  then  fj+f^  gj+g2  and  f i  ’  f2i  8j_  *  g2- 

(c)  If  f,  <  g  and  f2  <  g,  then  f j^+f  <  g. 

(d)  max(f  ,g)^  f+g. 

(e)  If  1  f  and  1  4  g,  then  f+g  4  f*g. 

(f)  If  1  4  f ,  then  f^  f+c. 

(g)  Let  S£S,x...xS  and  a  fc  S.  .  If  f  X  g,  then  f  •<  g  . 

i  n  i  —  °  a  —  a 

(h)  Let  S=S^US  , .  If  f  ^  g  on  S  and  f  g  on  S,,  then  f  ■<  g  on  S. 

Proof.  These  properties  follow  immediately  from  the  definition,  except 

for  (e).  To  prove  (e),  assume  1  ^  f  and  1  ^  g  so  that,  for  some  positive  real 

number  c,  cf  >  2  and  eg  >  2.  We  then  have  (cf-2) (cg-2)>  0,  so 

c  fg+4>  2c(f+g)  >  c(f+g)+i+.  Hence  c^fg>  c(f+g),  cfg>  f+g  and  f+g  4  fg.  § 
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3.  Computing  Time  Functions 

Let  A  be  any  algorithm  and  let  S  be  the  set  of  all  valid  inputs  to  A 
(the  elements  of  S  may  be  n-tuples).  We  associate  with  A  a  computing  time 
function  t,  defined  on  S,  t  (x)  being  tho  number  of  basic  operations  per- 
formed  by  the  algorithm  A  when  presented  with  the  input  x,  a  positive  in¬ 
teger.  This  assumes  that  the  algorithm  is  unambiguously  specified  in  terms 
of  some  finite  set  of  basic  operations.  Changing  the  set  of  brsic  operations 
(as  in  reprogramming  the  algorithm  for  a  different  computer)  will  result  in 
changing  the  computing  time  function  tA .  Alternatively,  we  could  take  the 
view  that  this  represents  a  change  in  the  algorithm.  However,  if  and 
are  two  sets  of  basic  operations  such  that  each  operation  in  B^  can  be  per¬ 
formed  by  a  fixed  sequence  of  operations  in  Bg,  and  vice  versa,  then  the  com¬ 
puting  time  functions  associated  with  B^  and  B^.  for  any  algorithm  A  are  co¬ 
dominant,  and  we  will  concern  ourselves  only  with  the  codominance  equivalence 

class  of  t  .  Thus  the  choice  of  basic  operations  is  somewhat  arbitrary.  We 
A 

assume  a  choice  which  is  consistent  with  any  of  the  existing,  or  conceivable, 
random  access  digital  computers  but,  in  order  to  avoid  the  triviality  of 
finiteness,  with  a  memory  which  is  indefinitely  expandable. 

The  function  t.  is  frequently  too  complex  to  be  of  interest  for  direct 

A 

oo  _ 

study.  Instead ,  we  ordinarily  decompose  S  into  a  disjoint  union  S  - 

where  each  S  is  a  non-empty  finite  set,  S  being  a  denumerable  set.  The  choice 
n 

of  decomposition  is  made  on  the  basis  of  some  prior  knowledge  or  some  conjecture 

about  the  general  behavior  of  t^.  Relative  to  a  decomposition  -CsP- (S^SgjS^,  ..  •} 

of  S  we  define  maximum,  minimum  and  average  computing  time  functions,  tA>  tA  and 

t*  on Qf  as  follows,  where  |S  I  denotes  the  number  of  elements  of  S  . 

A  1  n1  n 

tt(SJ-ma*xtS  CA(*h  (1) 


h 


(2) 


fi 

0 

B 

l! 

fl 

0 

i! 

ii 

IJ 

U 

0 

Q 

0 

D 

0 

0 

0 

fl 


t*(Sn)-nllnxtS  'jW' 
n 

tA('»/h„l-  (5) 

n 

As  illustrstion,  and  in  preparation  for  our  analysis  of  the  Euclidean 
algorithm,  let  us  consider  the  computing  times  of  the  classical  algorithms 
for  arithmetic  operations,  that  is,  addition,  subtraction,  multiplication 
and  division,  of  arbitrarily  large  integers.  We  assume  that  all  integers  are 
represented  in  radix  form  relative  to  an  integral  base  ££>2  ,  as  discussed  by 
Knuth  in  [11],  Section  4. 3*  We  know  that  the  computing  times  of  these  al¬ 
gorithms  depend  on  the  lengths  of  the  inputs. 

Following  Musser,  [12],  we  denote  by  L  (a)  the  p-length  of  the  integer  a, 

P 

that  is,  the  number  of  digits  in  the  radix  form  of  a  relative  to  the  base  g. 

If  T xl  is  the  ceiling  function  of  x,  the  least  integer  greater  than  or  equal 


to  x,  we  have 


Lp(aH  lo8p(|a |+i)l. 


00 


for  &f0,  and  we  define  L  (0)=1. 

p 

In  most  contexts  the  base  g  is  fixed  and  we  write  simply  L(a)  for  the 
length  of  a.  The  omission  of  the  subscript  is  further  justified  by  the  ob¬ 
servation  that,  y  being  any  other  base,  we  have 


L-L  , 
P  V 


(5) 


where  L  and  L  are  functions  defined  on  the  set  I  of  all  integers.  In  fact, 
P  V 

we  can  use  the  definition  (4)  when  a  is  any  real  number  and  we  then  have 


Lp(a)~/n(|a|+2)  on  R,  _  (6) 

where  In  is  the  natural  logarithm  and  R  is  the  set  of  all  real  numbers,  and 
(6)  clearly  implies  (5)*  Ihe  length  function  also  has  the  following  easily 
verified  fundamental  properties: 
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L(fl+b)  ^  L(a)+L(b)  for  a,b  £  I, 


I  the  set  of  integers, 

L(ab)/V  L(a)+L(b)  for  a,b£I-{0}»  (8) 

L([a/b])~L(a)-L(b)+l  for  a,b  £  I  and  |a|>|b|X).  (9) 

We  will  also  need  the  following  theorem. 

—  n 

Theorem  2 .  (a)  L(Tf^— i^l^^i^  ^or  a^>,,,»an^  (b)  b(  ^=^a^) 

^  ^>s^L(a^)  for  a^ >  •  • • »an *  I-f-1,0,1}. 

Proof.  L(ab)<  L(a)+L(b)  for  a,b£I,  so  by  induc¬ 

tion  on  n,  proving  (a).  To  prove  (b),  assume  first  that  2<|a^|<p  for  1<  i  <  n. 
Then  L(T^=iai)>  logBT^„1|a1|-(logp2)log2T^Bl|at|>  (logp2)log22n=(logB2)n=(logp2) 

Next,  assume  b^(a^)>  2  for  l<i<n,  and  let  i^L^a^).  Then  b(TT^=^a^)>logB 

(<-iW>  >  s£-iV2-  solJ-i  L(*i> 

Combining  these  two  cases,  we  may  assume  L(a^)“l  for  l<i  <  m  and  L(a^)>  2 
for  m+l<  i  <  n.  Then  £  ".1L(ai)<  (loggpW/^a^  +  2L(^TH-iai)5  2(lo82P) 
{L(T^=1ai)+L(T^=m+1ai)}<  4(log2p)L(TT^=1a1)  since  L(a)+L(b)<  2L(ab)  for 
a ,b  £  I-{0} .1 

It  should  be  noticed  that  a  simple  inductive  proof  of  (b)  was  not 
possible  because  n  is  regarded  as  a  variable,  not  as  an  arbitrary  but  fixed 
positive  integer.  As  an  immediate  corollary  of  Theorem  2,  we  have 

L(a^)~bL(a)  for  a,bCI,|a|>2  and  b>0.  (10) 

If  A,  M  and  D  are  the  classical  algorithms  for  addition  (or  subtraction), 
multiplication  and  division,  respectively,  as  described  in  [11],  Section  4. J, 
then  we  clearly  have 


(11) 


t  (a,b)~L(a)+L(b)  for  a  ,b «  I-fO} , 

A 

>1(a,b)^L(a)-L(b)  for  a  ,b  £  I-fO}  ,  (12) 

tp(a,b)'^  L(b)  •  L([a/b])  for  a,b£  I  and  la|>!b|>0.  (!}) 


Thus,  for  these  algorithms,  the  natural  decomposition  of  the  set 

"f  (a ,b)  :a,b £  i}  consists  of  the  sets  n=f  (a ,b)  :L(a )=m&L(b)":n}  .  If  we 

+  +  -  * 
write  t  (m,n)  in  place  of  t  n),  and  similarly  for  t  and  t  ,  then  from 


and  (13)1  and  using  (9).  we  have 

tA(m,n)'w  tA(m,n)  ^  tA(m,n)~  m+n, 

(14) 

t^Cm.n)^  ^(m.n)  **  tA(m,n)'0  mn, 

(15) 

tD(n,n)v  t  (m,n)  «  tD(m,n)rt<n(m-n+l)  for  m>  n. 

(16) 

Thus  for  these  algorithms  the  maximum,  minimum  and  average  computing 
times  all  coincide.  This  will  not  be  the  case  for  the  Euclidean  algorithm, 
to  which  we  now  ,.urn. 
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4  .  The  Maxinum  and  Minimum  Computing  T Imes  . 

For  simplicity,  and  without  loss  of  generality,  we  will  consider  the 
following  version  of  the  Euclidean  algorithm,  for  wnich  the  permissible  in¬ 
puts  are  the  pairs  (a,b)  of  positive  integers  with  a^.  The  output  of  the 
algorithm  is  the  positive  integer  c,tgcd(a,b). 

Algorithm  E 

(1)  [Initialize.]  c*-  a  ;  d«- b . 

(2)  [Divide.]  Compute  the  quotient  q  and  remainder  r  such  that  c“dq-rr 
and  0<  r  <  d,  using  algorithm  D. 

(3)  [Test  for  end.]  c<-  d  ;  d«-  r ;  if  d^O,  go  to  ( 2) . 

(4)  Return. 

This  algorithm  computes  two  sequences,  (al’a2 . a/+2)  and  (Vq2 . V 

such  that  a^a,  a  -b,  a  "q^a  +ai+?  with  O^^^i+l  for  l-  i-  and  a/+2*°' 

a^,...,a^+^  are  the  successive  values  assumed  by  the  variable  c  and  q^,...,q^ 

are  the  successive  values  assumed  by  the  variable  q.  (a^,...,a  is  called 

the  remainder  sequence  of  (a,b)  and  )  is  called  the  quotient  sequence 

of  (a,b).  Steps  (2)  and  (3)  are  each  executed  l  times;  this  is  the  number  of 
divisions  performed,  which  we  denote  by  D(a,b). 

By  (13)>  the  computing  time  for  the  i11^  execution  of  step  (2)  is 
L(q^)L(a^) .  The  computing  time  for  the  i"*1  execution  of  step  (3)  is 
certainly  dominated  by  L(ai(^  since  at  most  it  requires  copying  the  digits  of 
ai+l  and  ai+f  •  an  im?iementat  ion  of  the  algorithm  in  Which  a  large  integer 
is  represented  by  the  list  of  its  digits  (e.g.  [4])  such  copying  is  unnecessary 
and  the  computing  time  for  each  execution  of  step  (3)  is~l.  For  the  same 
reason,  we  will  assume  that  the  single  executions  of  steps  (1  ard  (4)  have 
computing  times'll.  We  then  have 
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tE(«,b)*£  ii_lL(qi)-L(«1+1).  (17) 

If  Instead  we  were  to  assume  that  copying  is  required  in  steps  (I)  and  (3), 

(17)  would  still  hold  after  adding  L(a^)  to  the  right  hand  side.  But  L(a^)<v' 

L(q^)+L(a^)  £L(q^)L(a^),  so  (17)  holds  in  any  case. 

From  (17)  we  will  derive  the  maximum,  minimum  and  average  computing  times 
of  Algorithm  E,  by  analyzing  the  possible  distributions  of  values  of  the  and 
q^,  obtaining  the  codorainance  equivalence  classes  of  these  computing  times  as 
functions  of  L(a)  ,  L(b)  and  L(c).  Thus  we  consider  the  decomposition  of  into 
the  sets 

Vn,k“t(a*b)  :L(*)"m4L(b)“n^L(8cd(*»b))“k)*  (10) 

with  m  >  n  >  k  >  1.  We  may  verify  that  each  set  S  .  is  non-empty  as  follows. 
“  “  “  cn  1  n  |  k 

If  m-k,  then  (  p"  1 ,  gm  l)‘  S  ,  .  If  m;^c,  let  a“pm  1  and  b«pn  * .  Then 

nij  n  1  k 

k-1 

c“gcd(a,b)*p  ,  L(a)“m,  L(b)*n  and  L(c)“k,  so  (a,b)c  S  ,  .  As  above,  we  will 

cn  1  n  |  k 

write  tg(ii,n,k)  in  Fiace  of  t*(Sm  r  R) ,  and  similarly  for  t^  and  t*. 

Theorem  3.  t*(m,n,k)  A  n(m-k+l). 

XL 

Proof.  Since  b-a2>  a^>  ...>  a^,  wp  hive  by  (17)  that 

tE(«,b)  £  L(b)^ J-1L(qi) .  ( 19) 

Since  L(a)^L(a+l)  for  e  >  1  and  since  q^  >2  we  obtain,  by  Theorem  2, 

£  wL(V^L(Vw(qi+l))*  (20) 

Since  ai"(liai+i+ai+2>  qiai+2+ai+2»  we  bave  cli+l<  ci/ai+2  *or  1  and  bence 
TT 1  ( q i-^! ) <  aia2/a ia,t+l *  Combining  this  with  q  *&  yields 

qfTTi-l(qi+1)  ^  ab/C?<  (20 
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yield 


Since  L(ab/c  )  <  L(a  /c2)*'-*  L(a/c)  ^*L(a)-L(c)+l ,  (19),  (20)  and  (21) 


tE(n,b)  <  L(b)(L(a)-L(c)+l),  (22) 

from  which  Theorem  3  is  immediate.® 

We  now  proceed  to  prove  that  t*(m,n,k)/w  n(m-k+l) ,  for  which  purpose  we 
need  the  following  two  theorems. 


Theorem  4  ■  tg(a,b)  £  D(a ,b) f D(a ,b)+L(gcd(a ,b) ) ) . 


Proof.  Let  (q^,...,q^)  and  (a^,...,a  g)  be  the  quotient  and  remainder 
sequences  of  (a,b),  c“gcd(a,b)  and  k“L(c).  By  (17), 

(23) 

Since  a^+  "0,  a/+l”c  and  ai“<liai+i'fai+2  -  ai+l+ai+2’  a  simple  induction 
shows  that  a^+o_j  >  c^i’  w*lerc  ^  is  the  it^1  term  of  the  Fibonacci  sequence, 
defined  by  F^O,  F^l  and  F^-F^F^ .  But  ([  10] ,  p.  82)  Fi+1  >0V/5,  where 

0“(l+/5)/2,  and  02>/5  so  F^  >0*.  Hence  i-lL^ai^-  Z  i«2lcSf/cIV  - 
i(l°ggC)+^  iogg^1  >  i(l°ggC)+(/22)(log^0).  So  for  k  >  2  and  l  >k, 

Z  i“lL^ai^  >  ^k/+(  l/l6)  ( log^) >  k  /+/'  while  for  k“l  and  /  >4, 

Z  i-lL(ai)  -  (l/l6)(l°g00)i2  >  /£^k/+/‘  .  For  f  <  3,  ^  [MlL(ai)  >  L(c)-k-kz+f 

Zl  2 

^  ^  >  k /+/  for  all  k  and  l,  proving  the  theorem, 

since  f**D(a,b)  .  I 


Theorem  5 .  For  every  positive  integer  n,  there  exist  positive  integers 
e  and  f  with  e  >  f,  L(e)“L(  f  )**n,  gcd(e,f)“l,  and  D(e,f)*n. 

Proof .  Let  be  the  generalized  Fibonacci  sequence  defined  by  F  ^*'*1, 


then  e  >  f  >  v 


] 

] 

] 


I 


F<h  -h,  and  F^«F^+  f[JJ  for  i  >0.  If  e-F^Jj 


n+1  and  f*F 


(h) 


n 


and  F  k ,  F  h  ,  ....  F,h^-h,  F  h^*l,  0  is  the  remainder  sequence  of  (e,f  so 
n+1  n  1  o 

gcd(e,f)“l  and  D(e,f)*n.  Hence  it  suffices  to  show  that  for  every  n  >  1  there 

is  an  h  >  1  such  that  0n_1<  F^<  F  h,,  <  0.  It  can  be  verified  by  calculation 
—  —  n  —  n+i 


"  n  —  n+1 

that  for  n  <6  this  holds  with  h*n. 

Since  F  h'-F  .+hF  for  n  >  1  (see  [ 10] ,  Section  1.2.8,  Exercise  lj)  and 
n  n-1  n  — 

F n«(0n_0n)/y^  Where  0_1-fc(  1-/5)  for  n  >  0  (see  [10],  Section  1.2.8,  Formula 
(lb)),  we  have  |Fn-0n/5|-|0" //5|-(  1^/017/5)0".  But  \$/<f>Y><  .009,  so 
jF  "0n/y5|<  .005  0n  for  n  >  5  and  hence  ^n/^n_i  ^  1*005  0  / *995  0  ^1.  110 

1 .6U  for  n  >  6. 

Assume  as  induction  hypothesis  that  0  <  F^  ^  <  F^.|  <  0  with  h_^nj  6. 

n  (k) 

Let  k  be  the  least  positive  integer  for  which  0  <  Fn+^.  Then  k  >h  and 

FS/F^1,'fFn+kVl>/fV(K-1>Fn+l1<k/(k-l)<7''6’  »°  Fi+i  <  W6)F^1)<(7/6)e". 

Also,  F^/F(k>-tFii+1+kF[i^}/(Fn+kFii+1)<mx(Fn+1/Fn,Fn^/Fii+11  <1.6K,  bo 

<  1.61*F(kj  <  (7/6)(1.61t)nn  <  2b"  <  b"+1  •  Hence  p"  <  F^j  <  F^  <  p" 
n+2  n’t*  i 

and  k  >  h+1  >  n+1,  completing  the  induction.  I 
Theorem  6.  t„(m,n,k)^n(m-k+l) . 

* -  L 

Proof.  By  Theorem  J,  it  suffices  to  prove  that  t*(m,n,k)  >  n(m-k+l  . 

Using  Theorem  5,  choose  e  and  f  with  e  >  f  >  0,  L(e)=L( f )=n-k+l ,  gcd(e,f  l 
and  D(e, f )*n-k+l .  Let  5*f  and  a=e+qf  where  q  is  the  least  non-negative  integer 
such  that  e+qf  >  pm’k.  If  q=0  then  8=e,  m=n  and  L(fl )*=m-k+.l .  If  q=l,  then 
m  >  n  so  a-e+f  <  2e  <  2pn_k+1  <  0n’k+2  <  pm’k+1  and  L(a)=m-k+l.  If  q  >  2  then 
a-e+qf  <  2e+(q-l)f  <  2(e+(q-l)f)<  2pm"k  <  pm-k+1  and  L(a)=m-k+l.  Also,  gcd(a,B)= 

gcd(f ,e+qf )*gcd(e, f )=1  and  D(a ,B)“D(e, f)*n-k+l . 

Let  c*=pk  \  a=Ic  and  b=5c.  Then  c=gcd(a,b),  L(c)=k,  L(a)=m,  L(b)=n  and 

D(a,b)*n-k+l.  Hence  by  Theorem  h  t|(»,n,k)  ^  (n-k+l)  {(n-k+l)+k}  ^n(n-k+l)  . 
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a 

o 

I! 

[J 

II 

LI 

U 

[j 

li 

k  j 

i! 

1! 

LI 

0 

0 

Q 

LI 


Also,  by  (17',  t*(m,n,k)  >  L(q1)L(a2)'«(m-n+l)  (n)  .  So  by  Theorem  1,  part  (c), 

tp(m,n,k)  >  n(n-k+l)+n(m-n+l)*Jn(m-k+l)  .  t 
E 

In  the  next  theorem  we  obtain  the  minimum  computing  time  of  the  Euclidean 

algorithm,  which  is  much  easier. 

Theorem  7.  t„(m,n,k) ^Jn(m-n+l)+k^n-k+l) . 

1  E 

Proof.  By  (17),  t‘(m,n,k)  >  L(q1)L(a2)-^n(m-n+l) .  Since  vtVai+iJ  ”e 
have  qm>  at/a1+1  and  «o  >  trJ.jta^a^J-a/c .  By  (17),  tE(a,b)~ 

lMl(qi)l(*l+l)  ^  >L(c)L(./c)  >L(c)L(b/c)~ 

L(c) {L(b)-L(c)+1} .  Hence  t"(m,n,k)  >  k(n-k+l)  and  by  Theorem  1,  Part  (c) , 

t  (m,n,k)  y  n(m-n+l)+k(n-k+l) . 

E 

If  n=k,  let  a=pm  1  and  b=pn  1  so  that  c=pn  1  and  D(a,b)=l.  By  (17).  this 
shows  that  t  (m,n,k)  -4  n(m-n+l)  ^  n(m-n+l)+k(n-k+l)  . 

If  n  >  k,  let  a=pm"1+pk"1  and  b=Bn"1,  so  that  c=ek_1,  L(a)-m  and  D(*,b)*2. 

Then  by  ( 17 )  ,  t"(m,n,k)  1  n(m-rrH)+k(n-k+l)  for  n  >  k.  Application  of  Theorem  1, 
E 

Part  (h),  concludes  the  proof,  f 


5  •  The  Average  Computing  Time 


As  observed  in  the  proof  of  Theorem  4,  if  a  >  b  and  (a^ag . "/+1»ai+  ) 

is  the  remainder  sequence  of  (u,b),  then  a  >  F  L  >  0f//5.  Since  e  >  /5,  we 
have  l  Infi  >  In  a  +1.  That  is, 

E)(a,b)  <  (in  <f>)  *■(  in  a  +1),  (24) 

with  (fn  <f>)  1 -2.078  .  .  ..  Dixon  established  in  [6]  that  for  every  >0 

|D(a,b)-Tfr.  a|  <  ( l  n  a)^"*"^  (^5) 

for  almost  all  pairs  (a,b)  with  u  >  a  >  b  >  1,  e.s  u  -»  ®,  where 

T=l2TT“2|n  2,  (26) 

and  we  have  t*0. 84276*  •  *.  By  more  elementary  means,  Dixon  proved  in  [7]  the 
weaker  result  that 

D(a,b)  >  ^/n  a  (27) 

for  almost  all  pairs  ( a,b)  with  u>a>b>lasu— ►<*>.  In  the  following,  we 
will  show  how  Dixon's  weaker  result  can  be  used  to  prove  that  the  average 
computing  time  of  the  Euclidean  algorithm  is  codominant  with  its  maximum  com¬ 
puting  time  of  n(m-k+l).  Before  proceeding  to  the  detailed  proof,  however,  I 
shall  present  an  intuitive  sketch. 

It  is  a  well-known  result  (see  [11],  Section  4.5.2,  Excercise  10)  that 

the  proportion  of  pairs  (a,b)  with  u  >  a  >  b  >  1  for  which  gcd(a,b)=l  approaches 
_2 

6tT  as  u  — >  00.  We  will  first  generalize  this  result  to  the  pairs  (a,b)  with 
u>a>b>vasu  -  v->®.  Next  we  set  u  =  pn_k+^  and  v  =  gn"k  and  conclude, 
combining  this  result  with  Dixon's,  that,  for  n-k  large,  at  least  half  of  the 
pairs  (a , b)  for  which  u  >  a  >  b  >  v  satisfy  both  gcd(a,b)=l  and  D(a,b)  >  £fn  a. 
For  each  pair  satisfying  these  conditions  and  each  c  with  gk~^<  c  <  fjk"^  we 
obtain  a  pair  (a,b)-(ac,bc)  with  gcd(a,b)=c,  L(a)=L(b)*=n  and  L(c)=k.  If  m  >  n 
than  from  each  pair  (a,b)  we  obtain  at  least 
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£0m"n  pairs  (a,b)  of  the  form  (aq+6,E)for  which  L(a)=m  and  these  also 
satisfy  L(b)«n,  L(gcd(a,b) )-k  and  D(a,b)  >  £fn  pn'k.  The  pairs  (a,b)  so 
obtained  constitute  at  least  .004g"2  of  all  pairs  in  S  k  and  tE(a,b)  £ 

n(m-k+l)  for  all  (a,b),  so  t*(a,n,k)  £  n(m-k+l)  for  n-k>h,  say.  But  it  is 

trivial  that  t*(m,n,k)  V  n(nv-k+l)  for  n-k<h  for  any  constant  h,  and  so 
E 

t  (m,n,k)'',n(m-k+l)  . 

Theorem  8.  Letu  and  v  be  positive  integers  withu>v,  let  w=  u-v,  and 
let  q  be  the  number  of  pairs  of  integers  (a,b)  such  that u  >  a,b  >  v  and 
gcd(a,b)”l.  Then  jq/V  “6/n2  |  <  (2f  n  w  +  1+ )/w . 

Proof.  Let  be  the  number  of  integers  a  such  that  k|a  and  u>  a  >  v . 


|vk-»/k|<l,  (28) 

and  is  the  number  of  pairs  (a,b)  for  which  k|gcd(a,b)  and  u>a,  b  >v.  By 
the  principle  of  inclusion  and  exclusion, 


q=Ik=i^(k)V 

where^fc  Is  the  Mobius  function.  By  (28), 

.  |vk-w2/k2|<2w/k+l 

Multiplying  (30  byyfc(k)/w  and  summing,  we  have,  by  (29), 

lq/w2'Ik=l^(k)/k2'<(2Hw+1)/w> 

where  Hw  is  the  harmonic  sum^k_^l/k.  Using 

^k=l^(k)/k2=T1^/6 

together  with  (3I)  yields 


Iq/w2-!!2 /6|<(2Hw+1)+^  k=w+l  1/k^ 


Ik 


Z2  (m  -2 

k*w+ll/k  <  J  w  x  an<*  w+^>  which  establishes  the  theorem 

after  substitution  in  (33)  •  I 

Theorem  j.  There  is  a  positive  integer  h  such  that  for  n-k>h,  there 
are  at  least  0.02pcn  ‘  pairs  (a,b)  for  which  gn  k+^  >  a  >  b  >  gn  \ 
gcd(a,b)=l,  and  D(a,b)  >  ^/n  a. 

Proof.  Set  0=^],  v=pn’k,  w=  u-v.  Since  6/T?2  >0.6,  lim 


W— >  60 


(2/n  w  +  4)/w=0,  and  gcd(a,b)=gcd(b,a) ,  by  Theorem  8  there  exists  h^  such  that 
there  are  at  least  0-3  w  pairs  (a,b)  for  which  u>  a  >  b  >  v  and  gcd(a,b)=l, 
for  n-k  >  h^.  By  Dixon's  theorem  there  is  an  h0  such  that  if  n-k  >  h^  then 

D(a,b)  <  iil n  a  for  at  most  0.05  pairs  (a,b)  with  u  >  a,  b  >  1.  Hence  if 

o 

h=max(h^,h2)  and  n-k  >  h  there  are  at  most  (l/4)w~  pairs  (a,b)  for  which 
u  >  a  >  b  >  v,  gcd(a,b)=l  and  D(a,b)  >  £in  a.  The  theorem  follows  since 
w  >  (/P"l)Bn"k  and  (/p-l)2/p  >  (/2-l)2/2  >  0.08.  I 


Theorem  10.  There  is  a  positive  integer  h  such  that  for  n-k>  h,  there 
are  at  least  0.004  pm+n  ^  pairs  (a,b)  such  that  a  >  b,  L(a)=m,  L(b)=n, 

L(gcd(a ,b) )=k  and  D(a,b)  >  ^/n  pn 

Proof.  Choose  an  h  for  which  Theorem  9  holds.  For  every  pair  (a,b) 

k-1  k-i 

satisfying  Theorem  9  and  every  integei  satisfying  p  <  c  <  p  *  we  obtain 
a  pair  (ac,bc)  with  ac  >  be,  L(ac )=L(bc)=n,  L(gcd(ac ,bc) )°L(c)=k,  and 
D(ac  ,bc)=D(a,b)  >  ^/n  a  >  $/n  pn  *C.  The  mapping  f  ( (a,b)  ,c)  =  (ac  ,bc)  thus 
defined  is  one-one  so  there  are  at  least  (0.02p‘;n  2k+^)  (/p-1)  pk  ^  >  0.008g£n 
pairs  (a,b)  with  a  >  b,  L(a )=L(b J°n ,  L(gcd(a,b) )=k  and  D(a,b)  >  £fn  pn  ^ .  if 
m=n  this  completes  the  proof,  so  assume  m  >  n.  For  each  pair  (a,b)  with 
L(a)=L(b)=n  there  are  at  least  [ ( pm - pm  ^)/aj>  (pm-pm  *)/pn  >  (1-p  ^)pm  ° 

^pm  n  pairs  (aq+b,a)  with  L(aq+b)=m.  Since  gcd(aq+b,a)=gcd(a ,b)  and 


D(aq+b,a)=D(a ,b)+l  we  obtain  at  least  (C.008gc  n  k)  (^0m~n)*O  .004  pm+n-k 
pairs  (aq+b ,a)  for  which  aq+b  >a,  L(aq+b)=m,  L(a)=n,  L(gcd(aq+b) ,a) )=k 
and  D(aq+b,a)  >  iin  pn  *t.  § 

Theorem  11.  t  (m,n,k)rw  n(m-k+l) . 

Ci 

Proof.  Let  c^=min( 1 p)  .  By  Theorems  4  and  10,  there  exist  h  and 
c2  >  0  scch  that  tE(a,b)  >  c2D(a,b) fD(a,b)+L(gcd(a,b))}  >  c^^n-k)  fc  (n-k)+k} 

2  4n  W 

>  c  c0n(n-k)  for  n-k>h  and  for  at  least  0.004 fT  "  elements  of  S  .  .  Every 
\  d  m,n,k  3 

element  of  Sm  n  is  of  the  form  (ac,bc)  with  a<  pm"k+1,  b<  pn_k+1  and  c<  pk, 

S0  Sm  n  k  haS  at  most  6  elements.  Hence,  tE(m,n,k)  >  0.004  c^CgP*  n(n-k) 

'v  n(n-k) for  n-k>h.  By  Theorem  7,  t  (m,n,k)  ^  n(m-n+l)  >  n  ^  n(n-k)  for  n-k<  h. 

Ci 

‘ft*  •if- 

Hence  by  Theorem  1,  Part  (h),  t  (m,n,k)  ^n(n-k).  By  Theorem  7,  t  (m,n,k)  > 

Ci  £ 

n(m-n+l)  so  by  Theorem  1,  Part  (c) ,  tE(m,n,k)  V  n(n-k)+n(m-n+l)=n(m-k+l) .  Hence 

-X- 

by  Theorem  6,  t  (m.n.k)^ n(m-k+l)  . 

Cl 
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